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We present a search for the standard model (SM) Higgs boson produced in association with a Z 
boson in 9.7 fb~^ of pp collisions collected with the DO detector at the Fermilab Tevatron Collider 
at = 1.96 TeV. Selected events contain one reconstructed Z — >■ e"'"e~ ot Z M^A*" candidate 
and at least two jets, including at least one jet likely to contain a b quark. To validate the search 
procedure, we also measure the cross section for ZZ production, and find that it is consistent with 
the SM expectation. We set upper limits at the 95% C.L. on the product of the ZH production 
cross section and branching ratio B{H bb) for Higgs boson masses 90 < Mh < 150 GeV. The 
observed (expected) limit for AIh = 125 GeV is a factor of 7.1 (5.1) larger than the SM prediction. 



I. INTRODUCTION 



In the standard model (SM), the spontaneous break- 
ing of the electroweak gauge symmetry generates masses 
for the W and Z bosons and produces a new scalar el- 
ementary particle, the Higgs boson Precision elec- 
troweak data, including the latest W boson mass mea- 
surements from the CDF 2] and DO Q collaborations 
and the latest Tevatron combination for the top quark 
mass constrain the mass of the SM Higgs boson to 
Mh < 152 GeV Q at the 95% confidence level (C.L.). 
Direct searches at the CERN e+e" CoUider (LEP) 6], by 
the CDF and DO collaborations at the Fermilab Tevatron 
pp ColUder 0, and by the ATLAS and CMS collabora- 
tions at the CERN Large Hadron ColHder (LHC) d, 
further restrict the allowed range to 122.1 < AI^ < 127.0 
GeV. ATLAS and CMS have discovered a new boson with 
properties consistent with those of the SM Higgs boson 
at AIh ~ 126 GeV [13) Ell, primarily through its decays 
into 77 and ZZ, while the CDF and DO collaborations 
have reported combined evidence for a particle consis- 
tent with such a boson produced in association with weak 
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bosons and decaying to bb [l2l |. 

For Mh ^ 135 GeV, the dominant Higgs boson decay 
is to the bb final state. At the Tevatron the best sen- 
sitivity to a low mass Higgs boson is obtained from the 
analysis of its production in association with a W or Z 
boson and its subsequent decay into pairs of b quarks. 
Evidence for a signal in this decay mode complements 
the ATLAS and CMS observations and provides further 
indication that the new particle is consistent with the SM 
Higgs boson that also couples directly to fermions. 

We present a search for the process ZH — > t'^d^hh, 
where £ is either a muon or an electron, in 9.7 fb~^ oipp 
collisions at -y/s ~ 1.96 TeV using the DO detector. This 
Article is a detailed description of a published Letter fT3| 
providing inputs included in the CDF and DO combina- 
tion described in Ref. [l^]. The CDF collaboration has 
performed a search in the same final state [l3]. This 
analysis extends and supersedes the previous DO result 
obtained on 4.2 fb^^ of integrated luminosity 15]. 

We select events that contain a Z boson candidate, re- 
constructed in one of four independent channels defined 
by lepton identification criteria. Selected events must 
also contain a Higgs boson candidate, reconstructed from 
two jets. At least one jet must be identified as likely to 
originate from a b quark ("6 tagged"). The backgrounds 
to this selection include the production of a Z boson in 
association with jets, ti production, diboson production, 
and multijet events with non-prompt muons or electrons, 
or with jets misidentified as electrons. They are esti- 
mated using Monte Carlo (MC) simulations and control 
samples in the data. We employ a kinematic fit to im- 
prove the reconstruction of the H ~> bb resonance. Sub- 
sequently, we develop a two-stage multivariate analysis 
to separate the signal from the backgrounds and extract 
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results from the shapes of the resulting multivariate dis- 
criminants. To validate the search procedure, we also 
present a measurement of the ZZ production cross sec- 
tion in the same final state used for the Higgs boson 
search. 

We describe the DO detector in Sectionlllland the event 
selection in the four analysis channels in Section IIIII 
Background and signal MC simulations are detailed in 
Section IIVI and multijet estimation is described in Sec- 
tion |Vl In Section IVH we discuss the normalization ap- 
plied to the background samples. The kinematic fit is 
described in Section I VIII We describe the multivariate 
analysis strategy in Section I Villi and the systematic un- 
certainties affecting the final results in Section IIXI We 
present the results for Higgs boson production and dibo- 
son production in Section |X] and summarize our results 
m Section IXll 



II. THE DO DETECTOR 

The DO detector [l^, consists of a central tracking 
system in a 2 T superconducting solenoidal magnet, sur- 
rounded by a central preshower (CPS) detector, a liquid- 
argon sampling calorimeter, and a muon spectrometer. 
The central tracking system consists of a silicon mi- 
crostrip tracker (SMT) and a scintillating fiber tracker 
(CFT), and provides coverage for charged particles in 
the pseudorapidity [Ts] range |?7dct| < 3, where rydot is the 
pseudorapidity measured with respect to the center of the 
detector. The CPS is located immediately before the in- 
ner layer of the calorimeter, and has about one radiation 
length of absorber, followed by three layers of scintillat- 
ing strips. The calorimeter consists of a central cryostat 
(CC), covering |?7dct| < 1-1, and two end cryostats (EC), 
covering up to |»7dct| ~ 4.2. In each cryostat the calorime- 
ters are divided into electromagnetic (EM) layers on the 
inside and hadronic layers on the outside. Plastic scin- 
tillator detectors improve the calorimeter measurement 
in the inter-cryostat regions (ICRs, 1.1 < |?7dct| < 1-5) 
between the CC and the ECs. The muon spectrome- 
ter is located beyond the calorimeter and consists of a 
layer of tracking detectors and scintillation trigger coun- 
ters before a 1.8 T iron toroidal magnet, followed by two 
similar layers after the toroid. It provides coverage up to 
jr^detl ~ 2. The instantaneous luminosity is measured by 
a system composed of two disks of scintillators positioned 
in front of the ECs. A three-level trigger system selects 
events for data logging and subsequent offline analysis. 

III. EVENT SELECTION 

The search is performed in four independent channels 
defined by the subdetectors used for lepton identifica- 
tion: the dimuon channel (/^/i), the muon + isolated 
track channel (/i/^trk), the dielectron channel (ee), and 
the electron -I- ICR electron channel (eejcR). The data 



for this analysis were collected from April 2002 to Febru- 
ary 2006 (Run 2a), and from June 2006 to September 
2011 (Run 2b). Between Run 2a and Run 2b, a new 
layer of the SMT was installed and the trigger system 
was upgraded [lOj. Run 2a corresponds to an integrated 
luminosity of 1.1 fb^^. Run 2b is further sub-divided into 
three periods that we analyze independently to account 
for time-dependent effects in the performance of the de- 
tector. We refer to them as Runs 2bl (corresponding to 
an integrated luminosity of 1.2 fb"-*^), 2b2 (3.0 fb~^), and 
2b3 (4.4 fb-i). 

A. Triggering 

In the ee and eeicR channels we analyze events ac- 
quired predominantly with triggers that provide real- 
time identification of electrons and jets. In the ee chan- 
nel we accept events that satisfy any trigger requirement, 
with a measured efficiency consistent with 100% within 
1%. In the eeicR channel the set of triggers used has 
an efficiency of 90-100% depending on the region of the 
detector toward which the electron points, and we apply 
the trigger efhciency, measured in data and parametrized 
by electron 77, electron 0, and jet transverse momen- 
tum, to the MC events as a weight. Specific selection 
requirements applied to the two channels are described 
in Sec. UlIBl 

In the \x\i and /x/Xtrk channels we accept events that 
satisfy any trigger requirement, although most were 
recorded using triggers that contain muon selection 
terms. To correctly model the efficiency of the inclusive 
set of triggers for these events, we develop a correction 
based on a reference data sample, for which we demand 
that the leading muon with |?7dct| < 1-5 satisfies one of 
the triggers that require a single muon. We confirm that 
this reference sample is well modeled by the MC when 
we apply the corresponding trigger efficiencies. We then 
derive a normalization correction factor equal to the ra- 
tio of the number of events in the inclusively triggered 
sample to the single-muon trigger sample in bins of the 
number of jets in the event. Shape-only correction fac- 
tors are determined in zero-jet events in bins of 77 of each 
of the two muons and the transverse energy imbalance 
(^x)- To account for changes in the trigger conditions, 
and hence efficiency, with time, we derive separate cor- 
rections for each of the four data-taking periods. Figure 
[T] shows as an example the correction factors for the /i/i 
channel in Run 2b3. 

After imposing data quality requirements, the inte- 
grated luminosity recorded by these triggers is 9.7 fb""'^ 
in each channel. 



B. Offline Event Selection 

The event selection in all channels requires a in- 
teraction vertex (PV) that has at least three associated 
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FIG. 1: (color online). Trigger correction factors for the iifi channel in Run 2b3 as a function of (a) jet multiplicity, (b) (c) 
r^det of the triggered muon, and (d) rydet of the other muon. The correction applied to the single muon trigger is the product of 
all four components. 



tracks, and is located within ±60 cm of the center of the 
detector along the beam direction. In the dimuon channel 
(nil) we select events with at least two muons identified in 
the muon system, matched to central tracks with trans- 
verse momenta pT > 10 GeV and |77dot| < 2. At least 
one muon must have |77dct| < 1-5 and > 15 GeV. 
The two muons must also have opposite charges. The 
distance between the PV and each of the muon tracks 
along the z axis, dpy, must be less than 1 cm. The dis- 
tance of closest approach of each muon track to the PV 
in the plane transverse to the beam direction, dpv, must 
be less than 0.04 cm for tracks with at least one hit in 
the SMT. Muon tracks without any SMT hits must have 
dpy < 0.2 cm, and the momentum resolution of these 
tracks is improved through a constraint to the position 
of the PV in the transverse plane. 

At least one muon must be separated from all jets (see 
below) by ATI = ^(A?])^ + (Acj))'^ > 0.5, where the jets 
must have pt > 20 GeV and |?7dct| < 2.5. If only one 
muon satisfies this criterion, we also require that the ratio 
(_Rtrk) of the vector sum of the transverse momenta of 
all tracks in a cone of ATZ < 0.5 around that muon to 
its pt satisfy i?TRK < 0.2, and that the ratio (i?cAL) 



of the transverse energy deposited in the calorimeter in a 
hollow cone with 0.1 < ATZ < 0.4 around that muon to 
its Pt satisfy i?cAL < 0.2. If both muons are separated 
from jets, then only the leading muon must satisfy the 
additional track and calorimeter isolation requirements 
described above. To reduce contamination from cosmic 
rays, the muon tracks must not be back-to-back in rj and 

The fifitrk channel is designed to recover dimuon events 
in which one muon is not identified in the muon sys- 
tem, primarily because of gaps in the muon system cov- 
erage. In this channel we require the presence of exactly 
one muon with |77dct| < 1-5 and pt > 15 GeV that 
must satisfy the same tracker and calorimeter isolation 
requirements used for the fifi channel. We also require 
the presence of an isolated track with |77dot| < 2 and 
Pt > 20 GeV, separated from the muon by ATZ > 0.1. 
This track-only muon (//trk) must have at least one SMT 
hit, dpv < 0.02 cm, and dpy < 1 cm. It must be sepa- 
rated from all jets having px > 15 GeV and |?7dct| < 2.5 
by ATZ > 0.5. It must also satisfy the same tracker and 
calorimeter isolation requirements as the first muon. The 
muons must have opposite charges. To ensure that the 
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/i/^ and /i/xtrk selections do not overlap, we reject events 
that contain any additional muons with |77det| < 2 and 
Pt > 10 GeV. For the small fraction of events (approx- 
imately 0.1%) with more than one track passing these 
requirements, the track whose invariant mass with the 
muon is closest to the Z boson mass is chosen. 

In the dielectron (ee) channel we select events with at 
least two electrons with pT > 15 GeV that pass selection 
requirements based on the energy deposition and shower 
shape in the calorimeter and the CPS. Electrons are ac- 
cepted in the CC with |rydet| < 1-1 and in the EC with 
1.5 < |?ydot| < 2.5, but at least one of the electrons must 
be identified in the CC. Electrons are selected from EM 
clusters reconstructed within a cone of radius TZ — 0.2 
and satisfying the following requirements: (i) at least 
90% (97%) of the cluster energy is deposited in the EM 
calorimeter of the CC (EC); (ii) the calorimeter isolation 
variable / = [E^ot - E^m] / E^m is less than 0.09 (0.05) in 
the CC (EC), where E^^t is the total energy in a cone of 
radius TZ — 0.4 and i?EM is the EM energy in a cone of 
radius TZ = 0.2; (iii) the scalar sum of the transverse mo- 
menta of all tracks in a hollow cone of 0.05 < ATZ < 0.4 
around the electron is less than 4 GeV in the CC, and 
less than or equal to to 2 GeV in the EC, depending on 
r]det of the electron; (iv) the output of an artificial neural 
network - which combines the energy deposition in the 
first EM layer, track isolation, and energy deposition in 
the CPS - is consistent with that expected from an elec- 
tron; (v) CC electrons must match central tracks or a set 
of hits in the tracker consistent with that of an electron 
trajectory; and (vi) for EC electrons the energy-weighted 
cluster width in the third EM layer must be consistent 
with that expected from an EM shower. 

In the eeicR channel, events must contain exactly one 
electron in cither the CC or EC with px > 15 GeV, and 
a track pointing toward one of the ICRs, where electro- 
magnetic object identification is compromised. This ICR 
track must be matched to a calorimeter energy deposit 
with Et > 15 GeV. The ICR electron must satisfy a 
requirement on the output of a neural network, designed 
to separate electrons from jets, that combines the track 
quality, the track isolation and the energy deposition in 
the scintillator detectors located in the ICR. If the elec- 
tron is found in the EC, we require that the ICR elec- 
tron has the same rapidity sign. In both the ee and the 
eeicR channels, any tracks matched to electrons must 
have < 1 cm. 

We reconstruct jets in the calorimeter using an it- 
erative midpoint cone algorithm (20j with a cone of 
TZ = 0.5. The energies of jets are corrected for de- 
tector response, presence of noise and multiple pp inter- 
actions, and energy fiowing out of (into) the jet cone 
from particles produced inside (outside) the cone [2l| . In 
all lepton channels, jets must have pt > 20 GeV and 
|?7dot| < 2.5. To reduce the impact from multiple pp in- 
teractions at high instantaneous luminosities, jets must 
contain at least two associated tracks originating from 
the PV. We further require that each of these tracks have 



at least one hit in the SMT. Jets meeting these criteria 
are considered "taggable" by the 6-tagging algorithm de- 
scribed below. However, jets separated from electrons 
selected in the ee and eeicR channels by ATZ < 0.5 are 
excluded from the analysis, as they are considered to be 
reconstructed from calorimeter activity generated by the 
electrons themselves. 

We use "inclusive" to denote the event sample selected 
by requiring the presence of two leptons with an invari- 
ant mass 40 < M« < 200 GeV. We use "pretag" to de- 
note the sample that meets the additional requirements 
of having at least two taggable jets with p^ > 20 GeV 
and |7?dct| < 2.5, and 70 < M« < 110 GeV. 

To distinguish events containing a H ^ bb decay 
from background processes involving light quarks (uds), 
c quarks, and gluons, jets are identified as likely to orig- 
inate from the decay of b quarks {b tagged) if they pass 
"loose" or "tight" requirements on the output of a neu- 
ral network trained to separate b jets from light quark 
or gluon jets. This discriminant is an improved version 
of the neural network 6-tagging discriminant described 
in Ref. (2^ , using a larger number of input variables re- 
lated to secondary vertex information, as well as a more 
sophisticated multivariate strategy. The 6-jet tagging ef- 
ficiency for taggable jets with |?7| < 1.1 and pr ~ 50 GeV 
and the corresponding misidentification rate of light jets 
are 72% and 7% for loose b tags, and 47% and 0.4% for 
tight b tags. We classify events with at least one tight 
and one loose b tag as double-tagged (DT). If an event 
fails the DT requirement, but contains a single tight b 
tag, we classify it as single-tagged (ST). The H bb 
candidate is composed of the two highest-px tagged jets 
in DT events, and the tagged jet plus the highest-px non- 
tagged jet in ST events. 



IV. MONTE CARLO SIMULATION 

The dominant background process for the ZH search is 
the production of a Z/j* boson (referred to hereafter as 
a Z boson) in association with jets, with the Z boson de- 
caying to leptons (Z-fjets). The light- fiavor component 
(Z-I-LF) includes jets from only light quarks or gluons. 
The heavy-fiavor component {Z+KF) includes Z+bb and 
Z -h CC production. The Z-I-LF, Z + bb, and Z -\- cc back- 
grounds are generated separately, and overlaps between 
them are removed. The remaining backgrounds are from 
ti, diboson (WW, WZ, and ZZ) and multijet produc- 
tion with non-prompt muons or electrons, or with jets 
misidentified as electrons. 

We simulate ZH and diboson production with 
PYTHIA [l^]. In the ZH samples, we consider the l^l~bb, 
e+e-cc, and i+e-T+T" final states. The £+e-bb final 
state accounts for 99% (97%) of the signal yield in the 
DT (ST) sample. The Z-|-jets and tt processes are sim- 
ulated with ALPGEN [131. The events generated with 
ALPGEN use PYTHIA for parton showering and hadroniza- 
tion. Because this procedure can generate additional jets. 
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we use the MLM matching scheme [25| to avoid double 
counting partons produced by ALPGEN and those subse- 
quently added by the showering in PYTHIA. All simulated 
samples are generated using the CTEQ6L1 [2^ leading- 
order parton distribution functions (PDF). To simulate 
the underlying event, consisting of all particles not origi- 
nating from the hard scatter of interest in the pp collision, 
we use DO Tune A [21]. 

All samples are processed using a detector simulation 
program based on GEANtS [1^. Events from randomly 
chosen beam crossings with the same instantaneous lu- 
minosity distribution as the data are overlaid on the gen- 
erated events to model the effects of multiple pp interac- 
tions and detector noise. Finally, the simulated events 
are reconstructed using the same offline algorithms used 
to process the data. 

We take the cross section and branching ratios for the 
signal from Refs. fso', [3l|. For the diboson processes, 
we use next-to-leading order (NLO) cross sections from 
MCFM [s^ . We scale the inclusive Z boson cross sections 
to next-to-NLO [ssj, and apply additional NLO heavy- 
flavor correction factors, also calculated from mcfm, of 
1.52 and 1.67 to the normalizations of the Z + bb and 
Z + cc samples, respectively. For the ti background, we 
use the approximate next-to-NLO cross section [s^ . 



A. MC Corrections 

Jet energy calibration and resolution are corrected in 
simulated events to match those measured in data, and 
we smear the energies of simulated leptons to reproduce 
the resolution observed in data. We apply scale factors 
to MC events to account for differences in reconstruc- 
tion efficiency between the data and simulation for jets 
and leptons. We also correct the efficiency for jets to be 
taggable and to satisfy 6-tagging requirements in the sim- 
ulation to reproduce the respective efficiencies in data. 

To improve the modeling of the pt distribution of the 
Z boson, we reweight the simulated Z boson events to 
be consistent with the observed Z boson px spectrum in 
data [35]. In our signal samples, we correct the generator- 
level Pt of the ZH system to match the distribution from 

RESBOS di]. 

Additional corrections are applied to improve agree- 
ment between data and background simulation, using two 
control samples with negligible expected signal contribu- 
tions: the inclusive and pretag samples discussed in Sec- 
tion HHbI Motivated by a comparison of the AL PGEN jet 
angular distributions with those from data ^3 and the 
SHERPA generator [sl], we reweight the Z-|-jets events to 
improve the modeling of the distributions of the pseudo- 
rapidities of the two jets. The reweighting factors are cal- 
culated with the pretag sample as the ratio of the data to 
the sum of the simulated Z+LF and Z+HF backgrounds 
after having subtracted all other backgrounds from the 
data. Since the energy resolution for jets in the ICR dif- 
fers from the resolution for jets in the CC or EC, we ex- 



clude jets with 1.0 < |77det| < 1.6 when determining these 
reweighting factors and develop a separate reweighting 
for jets in the ICR. These corrections are parametrized 
in r] and display variations of up to 20%. After applying 
the corrections, we renormalize to the yield from ALP- 
GEN. 



V. MULTIJET BACKGROUND 

The multijet backgrounds are estimated from control 
samples in the data. The selection criteria in each chan- 
nel are nearly the same as for the inclusive sample, with 
the differences described below. For the ee channel, the 
electron isolation and shower shape requirements are re- 
versed. The multijet sample in the ee channel suffers 
from a bias due to trigger conditions towards tighter elec- 
tron identification criteria. The multijet background is 
therefore reweighted to correct for this bias, and a sys- 
tematic uncertainty is assigned to account for the un- 
certainty in the fit that calculates the correction. For 
the eeicR channel, the electron in the ICR must fail 
the neural network output requirement described in Sec- 
tion |lTrB] In the /i/i channel, a multijet event must con- 
tain a Z boson candidate that fails any of the isolation 
requirements. The two muons forming the Z boson can- 
didate must have the same charge. In the /i/^trk channel, 
the multijet sample must pass all selection criteria, ex- 
cept that the two muons should have the same charge. 
These samples are used to define templates that are nor- 
malized by the procedure described in Section I VII The 
multijet background comprises approximately 7% of both 
the ST and DT samples after normalization. 



VI. NORMALIZATION PROCEDURE 

We adjust the normalization of the multijet back- 
ground and all simulated background and signal samples 
using a simultaneous template fit of the dilepton mass 
(Mu) distributions in each channel, data-taking period, 
and jet multiplicity bin (njet = 0, 1, or > 2). This im- 
proves the accuracy of the background model and reduces 
the impact of some systematic uncertainties. The inclu- 
sive event sample is used so that we fit to the inclusive 
Z boson cross section, which is known with much greater 
accuracy than the Z + 2 jets cross section. The fit min- 
imizes the x^: 



D2 



ki-zu. + 02 



(1) 

where m runs over the bins of Mu, j runs over njot, and i 
indicates the channel. In the normalization fit we divide 
the ee channel into two sub-channels: CC-CC, in which 
both electrons are in the CC, and CC-EC in which one 
electron is in the CC and one electron is in the EC. We 
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also divide each channel into the four data-taking periods 
(Run 2a, Run 2bl, Run 2b2, and Run 2b3). 

The number of data events are D^, and the fit adjusts 
the normalization of QJ^, the multijet sample, Z^, the 
simulated Z boson (including Z + bb and Z + cc) sample, 
and Oil, other simulated samples. The fit parameters 
are the multijet scale factors a*^ that apply to Qmi the 
combined luminosity and efficiency scale factors kl for 
channel i that are applied to Z^l and , and the Z bo- 
son cross section scale factors fc^ that apply to Z^. The 
parameters a^^ are fixed to unity for the fifitrk channel, as 
the only criterion in this channel for multijet selection is 
that the two muons fail the opposite-charge requirement, 
and a jet is equally likely to fake a /i+ or a /i^. We also 
fix fc^ = 1; approximately equivalent to assuming that 
the inclusive Z boson cross section is known exactly. In 
the assessment of the systematic uncertainty from the 
background fit, fc^ is varied within the uncertainty on 
the inclusive Z boson cross section .30,] . 

The fc^ parameters are expected to be independent of 
data-taking periods, since these are the cross section scale 
factors for Z-|-jets production and any time-dependent 
detector effects should be absorbed by kl. However, we 
observe a discrepancy in /c^ between the Run 2a and 
Run 2b data, which we attribute to differences in jet 
reconstruction and identification algorithms between the 
two epochs. For this reason, we perform two separate 
fits for the /c^- 1) using the Run 2a period only, and 2) 
using the Run 2b period only, but keeping the separation 
between Run 2bl, Run 2b2, and Run 2b3 for the other 
parameters. We assign a systematic uncertainty on the 
Run 2a normalization to account for this discrepancy. 
Tables [T] and [Til show the results of the fits for Run 2a 
and Run 2b, respectively. In Section |IX] we discuss the 
uncertainties arising from the normalization procedure. 

TABLE I: Parameters from the combined normalization fit 
for Run 2a. Statistical uncertainties are less than 1%, and 
systematic uncertainties are on the order of 5%. There are 
no uncertainties for a'-' for the /i/itrk channel or for k% since 
they are fixed. 



Channel 


kl 


a'" 


a a 


Run 2a 


eeCC-CC 


1.03 


0.34 


0.29 0.14 


eeCC-EC 


1.01 


0.33 


0.27 0.29 


eeicR 


1.02 


0.12 


0.07 0.01 




0.93 


1.4 


0.46 0.44 




0.91 


1 


1 1 







'^z 


1 


0.97 


1.06 



As a cross-check, we repeat the fit for each channel in- 
dependently, and find the results to be consistent with 



TABLE II: Parameters from the combined normalization fit 
for Run 2b. Statistical uncertainties are less than 1%, and 
systematic uncertainties are on the order of 5%. There are 
no uncertainties for a'^ for the /^/itrk channels or for k% since 
they are fixed. 



Channel 


kl 


a'" 


a 


a 


Run 2b 1 


eeCC-CC 


0.99 


0.18 


0.13 


0.14 


eeCC-EC 0.97 


0.17 


0.15 


0.15 


eeicR 


0.97 


0.11 


0.08 


0.10 


MM 


0.97 


1.4 


0.44 


0.31 




1.04 


1 


1 


1 


Run 2b2 


eeCC-CC 


1.02 


0.10 


0.11 


0.14 


eeCC-EC 


1.01 


0.099 


0.11 


0.14 


eeicR 


0.92 


0.077 0.065 


0.061 




0.98 


1.5 


0.41 


0.41 


MMtrk 


1.03 


1 


1 


1 


Run 2b3 


eeCC-CC 


1.04 


0.13 


0.12 


0.13 


eeCC-EC 


1.04 


0.12 


0.11 


0.11 


eeicR 


1.01 


0.080 


0.071 


0.061 


MM 


0.99 


1.2 


0.44 


0.35 


MMtik 


1.01 


1 


1 


1 



l^Z 


i^z 




1 


0.90 


0.94 



the simultaneous fit. We assign the RMS of the observed 
deviations from the combined fit as a systematic uncer- 
tainty. 

Table IIIII gives the number of events observed in the 
inclusive, pretag, ST and DT samples, and the expected 
number of events for the different background compo- 
nents and the signal (assuming Mu = 125 GeV), fol- 
lowing all MC corrections and the normalization fit. 

Figure [2] shows the jet multiplicity distribution in the 
inclusive sample for the combination of all channels. The 
dimuon and dielectron mass spectra in the pretag sample 
are shown in Fig. [31 In Figs. [Jj and [SJ we show distri- 
butions of the transverse momenta of the two jets with 
the highest j3t and the invariant mass of the dijet system 
constructed from those two jets. In all plots, data points 
are shown with error bars that reflect statistical uncer- 
tainty only, and discrepancies in data-MC agreement are 
within the systematic uncertainties described in Sec. IIXI 



VII. KINEMATIC FIT 

We use a kinematic fit to improve the resolution of the 
dijet invariant mass. The fit varies the energies and an- 
gles of the two leptons from the Z boson candidate, and 
of the two jets that form the Higgs boson candidate (and 
of a third jet, if present) within their experimental res- 
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TABLE III: Expected and observed event yields for all lepton channels combined after requiring two leptons (inclusive), 
after also requiring at least two taggable jets and 70 < Mti < 110 GeV (pretag), and after requiring exactly one (ST) or at 
least two (DT) b tags. The ZH yields are given for Mh = 125 GeV. Expected yields are obtained following the background 
normalization procedure described in Section IVII The uncertainties quoted on the total background and signal include all 
systematic uncertainties and uncertainties from limited MC statistics. 

Data Total Background MJ Z+LF Z+UF Diboson tt ZH 

Inclusive 1845610 1841683 160746 1630391 46462 2914 1170 17.3 ± 1.1 

Pretag 25849 25658 1284 19253 4305 530 285 9.2 ± 0.6 

ST 886 824 ± 102 54 60 600 33 77 2.5 ± 0.2 

DT 373 366 ±39 25.7 3.5 219 19 99 2.9 ± 0.2 



-Inclusive 



D0, 9.7 fb 
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■ z+bb BZ+cc 

HTop I Diboson 
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3 4 5 

Number of jets 



FIG. 2: (color online). Jet multiplicity distribution in the 
inclusive sample, summed over all lepton channels, along 
with the background expectation. The signal distribution for 
Mh = 125 GeV is scaled by a factor of 500. 



olutions, subject to three constraints: the reconstructed 
dilepton mass must be consistent with the Z boson mass 
and the x and y components of the vector sum of the 
transverse momenta of the leptons and jets must be con- 
sistent with zero. 

The fit minimizes a negative log likelihood function: 



InLfit = -^ln/^(?/: 



obs prcd 



3 



(2) 



where Cj {j = 1,2,3) are the probability densities for 
kinematic constraints, and fi is the probability density 
(transfer function) for observable y°^^ whose predicted 
value is yf"^*"^- The fit contains twelve independent ob- 
servables for events with two jets: four particles x three 
variables {E or 1/pt, V and (f>). For events with three 
jets, there are fifteen observables. 

The probability density for the Z boson mass con- 
straint is a Breit-Wigner function using the values for 
the mass and width of the Z boson from Ref. [39l| . The 
constraints on the total transverse momentum compo- 
nents are Gaussian distributions with a mean of zero and 
a width of 7 GeV, as determined from the simulated ZH 
samples. 

We use Gaussian transfer functions for all observables 
except the energies of the jets. In this case we use three 



sets of transfer functions, derived from MC studies for: 
(i) jets that originate from a b quark and do not contain a 
muon, (ii) jets that originate from a b quark and contain 
a muon, and (iii) jets that originate from a light quark or 
gluon. For the jets that form the Higgs boson candidate 
we use one of the b quark transfer functions, depending 
on whether they contain a reconstructed muon. For the 
third jet, if present, we use the light-quark transfer func- 
tion. 

The kinematic fit improves the dijet mass resolution 
by 10—15%, depending on Mh- The resolution for 
Mh = 125 GeV is approximately 15 GeV (i.e. 12%) after 
the fit. Distributions of the dijet invariant mass spec- 
tra, before and after adjustment by the kinematic fit, are 
shown in Fig. [S] 



VIII. MULTIVARIATE ANALYSIS 

We use a two-step multivariate analysis strategy based 
on random forest discriminants (RF), an ensemble clas- 
sifier that consists of many decision trees [40l| , as imple- 
mented in the tmva software package [4l|, to improve 
the discrimination of signal from background. In a first 
step, we train a dedicated RF (ti RF) that considers ti 
as the only background and ZH as the signal. This ap- 
proach takes advantage of the distinctive signature of the 
ti background, for instance the presence of large S^t- In 
a second step, we use the tt RF output to define two in- 
dependent regions: a tt-enriched region and a tt-depleted 
region. In each region, we train a global RF to separate 
the ZH signal from all backgrounds. In both steps we 
consider ST and DT events separately and train the dis- 
criminants for each value of the tested Higgs boson mass 
in the range 90 < Mh < 150 GeV in steps of 5 GeV. 



Compared to the result described in Ref. [15|, this two- 
step strategy improves sensitivity to the signal by 5-10%, 
depending on Mh- 

The input variables used for the multivariate analysis 
include the transverse momenta of the two 6-jet candi- 
dates and the dijet mass, before and after the jet ener- 
gies are adjusted by the kinematic fit, angular differences 
between the jets, between the leptons, and between the 
dijet and dilepton systems, the opening angle between 
the proton beam and the Z boson candidate in the rest 
frame of the Z boson, cos 6* and composite kine- 
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FIG. 3: (color online). The dilepton mass spectra, along with the background expectation, for the (a) fifi, (b) fi^tA, (c) ee and 
(d) eeicR channels in the pretag sample. The signal distributions for Mr = 125 GeV are scaled by a factor of 500. 
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FIG. 4: (color online). The pr spectra of the (a) leading and 
(b) sub-leading jets, along with the background expectations, 
summed over all lepton channels in the pretag sample. The 
signal distributions, for Mr = 125 GeV, are scaled by a factor 
of 500. 
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FIG. 5: (color online). Distribution of the dijet invariant 
mass, along with the background expectation, summed over 
all lepton channels in the pretag sample. The signal distribu- 
tion, for AIh ~ 125 GeV, is scaled by a factor of 500. 
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matic variables, such as thepr of the dijet system and the 
scalar sum of the transverse momenta of the leptons and 
jets. Table ITVl provides a complete list of input variables. 
We show selected distributions of the input variables in 
Figs. [7] and [8] for ST and DT events, respectively. 

To avoid biases in the training procedure, we divide the 
MC samples into three independent sub-samples: 25% of 
the events are used to train the RFs (for both the ti RF 
and the global RF); 25% of the events are used to test 
the RF discrimination performance and check for over- 
training (for both the ti RF and the global RF) , and the 
remaining 50% of the events (the evaluation sub-sample) 
are used for the statistical analysis to obtain Higgs boson 
cross section limits. 

Figures [9] and [101 show the pretag distributions of the ti 
RF and the global RF outputs, respectively, trained for 
Mh — 125 GeV. Figures [TTIIT^ show the corresponding 
distributions after applying the 6-tagging requirements 
for several different values of Mh- The requirement that 
separates the t?-depleted region {ti RF < 0.5) and the 
tt-enriched region {ti RF > 0.5) is shown in Figs. [HI and 

HH 



FIG. 6: (color online). The dijet invariant mass for the simu- 
lated ZH signal, at Mh = 125 GeV, summed over all lepton 
channels in the pretag sample, shown before and after the 
kinematic fit. 



IX. SYSTEMATIC UNCERTAINTIES 

We assess the impact of systematic uncertainties on 
both the normalization and shape of the predicted global 
RF distributions for the signal and for each background 
source. We summarize the magnitude of these uncer- 
tainties in Tables [Vl- 1 VIII and provide additional details 
below. Unless otherwise stated, we consider each source 
of systematic uncertainty to be 100% correlated for each 
process across all samples. 

The uncertainties on the integrated luminosity and the 
lepton identification efficiencies are absorbed by the un- 
certainties on the normalization procedure described in 
Section [VTl The uncertainties on the normalization of the 
multijet background are determined from the statistical 
uncertainties on the fit, typically around 10%. These are 
uncorrelated across channels but are correlated within a 
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FIG. 7: (color online). Distributions in ST events of (a) the dijet invariant mass corrected by the kinematic fit, (b) the pr of 
the leading jet from the Higgs boson candidate, (c) the pr of the sub-leading jet from the Higgs boson candidate, (d) the pT 
of the dijet system divided by the scalar sum of the transverse momenta of the two jets, (e) the colinearity of the two leptons, 
and (f) cos 6'* '42']. The signal distributions for Mh = 125 GeV are scaled by a factor of 100. 



channel (i.e., between the different 6-tag samples, and 
between the tt-depleted and enriched regions). We com- 
pare the value of from the combined normalization to 
the values obtained from independent fits in each channel 
We assess an uncertainty for each channel that is equal to 
the RMS (3-5%) of the observed deviations. This uncer- 
tainty is taken to be uncorrelated across channels. The 
normalization of the Z-|-jets background to the pretag 
data constrains that sample within the statistical uncer- 



tainty (1-2%) of the pretag data. Since this sample is 
dominated by the Z+LF background, the normalization 
of the ti, diboson, and ZH samples acquires a sensitivity 
to the inclusive Z boson cross section, for which we as- 
sess a 6% uncertainty [s^ . We assign this uncertainty to 
these samples as a common uncertainty. We apply a 9% 
uncertainty to the Run 2a prediction of Z+LF produc- 
tion to account for the different values of fc| obtained for 
Run 2a and Run 2b. For Z+RF production, we evalu- 
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FIG. 8: (color online). Distributions in DT events of (a) the dijet invariant mass corrected by the kinematic fit, (b) the pr of 
the leading jet from the Higgs boson candidate, (c) the pr of the sub-leading jet from the Higgs boson candidate, (d) the pT 
of the dijet system divided by the scalar sum of the transverse momenta of the two jets, (e) the colinearity of the two leptons, 
and (f ) cos 9* . The signal distributions for Mh = 125 GeV are scaled by a factor of 50. 



ate a cross section uncertainty of 20% based on Ref. [32| . 
For the diboson and tt backgrounds, we take the uncer- 
tainties on the cross sections to be 7% [111 and 10% [sl j. 
respectively. The cross section uncertainty for the signal 
is 6% [13. 

Sources of systematic uncertainty affecting the shapes 
of the final discriminant distributions are the jet energy 
scale, jet energy resolution, jet identification efficiency, 
and ^-tagging efficiency. Shape uncertainties are assessed 



by repeating the full analysis with each source of uncer- 
tainty varied by ±1 s.d. Other sources include trigger 
efficiency, multijet modeling in the ee channel, PDF un- 
certainties [3 J data-determined corrections to the model 
for Z+jets, modeling of the underlying event, the MLM 
matching applied to ALPGEN Z+LF events ;25] , and from 
varying the factorization and renormalization scales for 
the ALPGEN Z-fjets simulation. 
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TABLE IV: Variables used for the tt and global RF training. The jets that form the Higgs boson candidate are referred to as 
61 and 62, ordered in pt- 

variables tt RF global RF 



Invariant mass of the dijet system before (after) the kinematic fit %/ *\/ 

Transverse momentum of the first jet before (after) kinematic fit V V 

Transverse momentum of the second jet before (after) kinematic fit V V 

Transverse momentum of the dijet system before the kinematic fit V V 

A(j) between the two jets in the dijet system — ^y 

Arj between the two jets in the dijet system — 
Invariant mass of all jets in the event 
Transverse momentum of all jets in the event 

Scalar sum of the transverse momenta of all jets in the event — 

Ratio of dijet system pr over the scalar sum of the pr of the two jets (pt /(Ipt I + IPt I)) V ~ 
Invariant mass of the dilepton system — 

Transverse momentum of the dilepton system ^y y/ 
/S.<j) between the two leptons -J 
cosine of the angle between the two leptons (colinearity) s/ 

between the dilepton and dijet systems 
cosine of the angle between the incoming proton and the Z in the zero momentum frame (cos 6'*) [42] — ^ 

Invariant mass of dilepton and dijet system — 

Scalar sum of the transverse momenta of the leptons and jets — -J 
Missing transverse energy of the event — 

significance [43] 

Negative log likelihood from the kinematic fit (Eq. [T]) V 
U RF output - \l 



X. RESULTS 

We use the global RF output distributions of the 
four sub-samples (ST and DT in the tt-depleted and ti- 
enriched regions) in each channel along with the cor- 
responding systematic uncertainties to extract results 
for both Higgs boson production and diboson produc- 
tion. The use of separate channels and sub-samples 
takes advantage of the sensitivity from the signal-rich 
sub-samples and allows for a better background assess- 
ment based on the signal-poor sub-samples. The binning 
of each distribution is chosen such that the statistical un- 
certainty for each bin is less than 20% for the signal-plus- 
background prediction and 25% for the background-only 
prediction. 

We evaluate the consistency of the data with the 
background-only (B) and signal-plus-background {S + 
B) hypotheses using a modified frequentist (CL5) 
method [i^. This method uses the negative log likeli- 
hood ratio LLR — —2\ii{Ls+b/ Lb), where Lg+B and 
Lb are the Poisson likelihoods for the S + B and the B 
hypotheses, respectively. 

We combine our results by summing the LLR over all 
bins of all contributing channels and sub-samples. The 
signal and background predictions are functions of nui- 
sance parameters that account for the presence of sys- 
tematic uncertainties. We maximize Ls+b with respect 
to the S + B hypothesis and L b with respect to the B 
hypothesis with independent fits that allow the sources of 
nuisance parameters to vary within Gaussian priors (46| . 
The maximized values oi Lb and Ls+b are then used in 
the calculation of the LLR. 



We integrate the LLR distributions obtained from B 
and S + B pseudo-experiments to obtain the p-values 
CLb and CLs+b for the two hypotheses. If the data 
are consistent with the B hypothesis, we exclude values 
of the product of the ZH production cross section and 
branching ratios for which CL5 = CLs+b /CLb < 0.05 
at the 95% C.L. 



A. Results for Diboson Production 

To validate the search procedure, we search for ZZ 
production in the £^i~bb final state. We use the same 
event selection, corrections to our background models, 
normalization fit parameters, RF training procedure, and 
statistical analysis methods as for the ZH search. Our 
search also includes contributions from ZZ — > £+£~cc 
and WZ production in the cs£^i~ final state where the 
c jet passes the ^-tagging requirement. We collectively 
refer to them as VZ production. The WW process is 
considered to be background. 

Figure [14] compares the LLR value observed in the 
data to distributions obtained from B and S + B pseudo- 
experiments. To obtain ayz in units of the SM value, 
we maximize Ls+b with respect to the nuisance param- 
eters and a signal scale factor /, keeping the ratio of 
the ZZ and WZ cross sections fixed to the SM pre- 
diction. We find / = 0.8 ± 0.6, which translates to 
o'vz = 3.5±2.5 pb given the predicted total SM cross sec- 
tion of avz — 4.4 ± 0.3 pb [32l- Figure ITSl compares this 
result to the SM cross section and to the distribution of 
results obtained from B and S + B pseudo-experiments. 
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FIG. 9: (color online). The tt RF output {Mr = 125 GeV) for all lepton channels combined in the pretag sample (a) trained 
for ST events and (b) trained for DT events. The arrows indicate the tt RF selection requirement used to define the tt-enriched 
and depleted sub-samples. The signal distributions for Mh = 125 GeV are scaled by a factor of 500. 
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FIG. 10: (color online). Global RF output {Mr = 125 GeV) for all lepton channels combined for (a) pretag events evaluated 
with the ST-trained RF and (b) pretag events evaluated with the DT-trained RF. The signal distributions for A4h = 125 GeV 
are scaled by a factor of 500. 



The probability (p-value) that the B hypothesis results 
in a cross section greater than that determined from the 
data is 0.071, equivalent to 1.5 standard deviations (s.d.). 
The expected p-value is 0.032, corresponding to 1.9 s.d. 
In Figs. [16] and [IT] we show the global RF and post- 
kinematic fit dijet mass distributions after the likelihood 
fit, separately for ST and DT events in the tt-depleted re- 
gion. The diboson signal consists of 66% (93%) ZZ pro- 
duction and 34% (7%) WZ production in the ST (DT) 
sample. 



B. Higgs Boson Search Results 

In Figs . [T51 and [TOl we show the global RF distributions 
for AIh = 125 GeV after the fit of the nuisance param- 
eters to the data in the B hypothesis. Figure [20] shows 
the observed and expected (median) LLR values for the 
individual analysis channels. Also shown are the upper 
limits at the 95% C.L. on the product of the ZH produc- 



tion cross section and branching ratio for H — ^ bb. The 
LLR values for all lepton channels combined are shown 
in Fig. [2lT a). and limits are shown in Fig. [2lT b) and Ta- 
ble IVIIII The limits are expressed as a ratio to the SM 
prediction. At Mh — 125 GeV the observed (expected) 
limit on this ratio is 7.1 (5.1). 



XI. SUMMARY 

In summary, we have searched for SM Higgs boson 
production in association with a Z boson in the final 
state of two charged leptons (electrons or muons) and 
two fo-quark jets using 9.7 fb~^ of pp collisions at ^/s = 
1.96 TeV. To validate the methods used in this analysis, 
we have determined the cross section for ZZ production 
in the same final state and found it to be a factor of 
0.8±0.6 relative to the SM prediction, with a significance 
of 1.5 s.d. We have set an upper limit on the product of 
the ZH production cross section and branching ratio for 
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FIG. 11: (color online). The tt RF output for all lepton channels combined in ST and DT events for Mh = 115 GeV (a, b), 
for Mh ~ 125 GeV (c, d), and for Mh = 135 GeV (e, f). The signal distributions correspond to the Mh used for the RF 
training and are scaled by a factor of 40. The arrows indicate the tt RF selection requirement used to define the tt-enriched 
and depleted sub-samples. 



iJ ^ 66 as a function of Mh- The observed (expected) 
limit at the 95% C.L. for Mh = 125 GeV is 7.1 (5.1) 
times the SM expectation. 



Acknowledgments 

We thank the staffs at Fermilab and collaborating 
institutions, and acknowledge support from the DOE 



and NSF (USA); CEA and CNRS/IN2P3 (France); 
FASI, Rosatom and RFBR (Russia); CNPq, FAFERJ, 
FAPESP and FUNDUNESP (Brazil); DAE and DST (In- 
dia); Colciencias (Colombia); CONACyT (Mexico); KRF 
and KOSEF (Korea); CONICET and UBACyT (Ar- 
gentina); FOM (The Netherlands); STFC and the Royal 
Society (United Kingdom); MSMT and GACR (Czech 
Republic); CRC Program and NSERC (Canada); BMBF 
and DEC (Germany); SFI (Ireland); The Swedish Re- 



17 



- (a) Single Tag 




D0, 9.7 fb "^ 

• Data 
Z+LF 

■ z+bb 
Z+cc 

Top 

Diboson 
Multijet 
ZH X 40 
M„=n5 GeV 



0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 

Global RF Output 



140 
120 
100 
80 
60 
40 
20 




r(c) Single Tag 

tf RF > 0.5 
I 
I 




D0, 9.7fb "^ 

• Data 

Z+LF 

Z+bb 
z+cc 
Top 

Diboson 
Multijet 
ZH X 40 

M,,=125 GeV 



0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 

Global RF Output 



140 
120 
100 
80 
60 
40 
20 




^(e) Single Tag 

tf RF > 0.5 
i I 




D0,9.7fb"^ 

• Data 

Z+LF 

Z+bb 
z+cc 
Top 

Diboson 
Multijet 
ZH X 40 

M,,=135 GeV 



0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 

Global RF Output 



o 



> 
pq 



m 
o 



> 

pq 



m 
o 



<u 
> 
pq 



50 
40 
30 
20 
10 


60 
50 
40 
30 
20 
10 


70 
60 
50 
40 
30 
20 
10 




(b) Double Tag 
-tf RF > 0.5 



D0, 9.7 fb "^ 

• Data — Z+LF 
■ z+bb BZ+cC 
""Top Diboson 
BMultijet-...ZHx40 

M,,=115 GeV 




0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 

Global RF Output 



r(d) Double Tag 
tf RF > 0.5 




D0, 9.7fb "^ 

• Data 

Z+LF 

Z+bb 
z+cc 
Top 

Diboson 
Multijet 

ZH X 40 

M,,=125 GeV 



0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 

Global RF Output 



\ (f) Double Tag 


D0,9.7fb"^ 






• Data 


: tt RF > 0.5 


Z+LF 






z+bb 


II 




z+cc 






Top 






Diboson 






Multijet 






ZH X 40 






M„=135 GeV 




I 1 • 1 1 i 

II 9 1 ' 





0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 

Global RF Output 



FIG. 12: (color online). Global RF distributions for ST and DT events in the tt-depleted region for Mh = 115 GeV (a, b), for 
Mh — 125 GeV (c, d), and for Mh — 135 GeV (e, f). The signal distributions correspond to the Mh used for the RF training 
and are scaled by a factor of 40. 
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FIG. 13: (color online). Global RF distributions for ST and DT events in the tt-enriched region for Mh = 115 GeV (a, b), for 
AIh ~ 125 GeV (c, d), and for AIh ~ 135 GeV (e, f). The signal distributions correspond to the Mh used for the RF training 
and are scaled by a factor of 400. 
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TABLE V: Systematic uncertainties that are common across all sub-samples. Systematic uncertainties for ZH production 
shown in this Table are obtained for Mh = 125 GeV. Relative uncertainties are given in percent. When two numbers are given, 
the first is for Run 2b and the second is for Run 2a. 



Relative uncertainties (%) 



Contribution 
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Z + bb 
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tt 


Multijet Normalization 




10 












k% Uncertainty 


1.6 / 6.9 










1.6 / 6.9 


1.6 / 6.9 


fc| Uncertainty 






0.7 / 1.8 


0.7 / 1.8 


0.7 / 1.8 






kl RMS 


5.1 / 3 




5.1 / 3 


5.1 / 3 


5.1 / 3 


5.1 / 3 


5.1 / 3 


Run 2a Normalization 


-/9 
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Theoretical Cross Sections 
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TABLE VI: Systematic uncertainties on ST events in the tt-dcpleted and enriched regions. Systematic uncertainties for ZH 
production shown in this Table arc obtained for Mh = 125 GeV. Relative uncertainties are given in percent. As these 
uncertainties change the shape of the global RF distributions, the numbers refer to average per-bin changes. When a range is 
given, the uncertainty varies by Z boson decay channel. 
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Relative uncertainties (%) in the tt-enriched region for ST events 
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TABLE VII: Systematic uncertainties on DT events in the tt-depleted and enriched regions. Systematic uncertainties for 

ZH production shown in this Tabic arc obtained for A/h = 125 GcV. Relative uncertainties are given in percent. As these 
uncertainties change the shape of the global RF distributions, the numbers refer to average per-bin changes. When a range is 
given, the uncertainty varies by Z boson decay channel. 



ZH — >• Ubh Relative uncertainties (%) in the ff-depleted region for DT events 
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ZH iibb Relative uncertainties (%) in the tt-enriched region for DT events 
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TABLE VIII: The expected and observed upper limits at the 95% C.L. on the SM Higgs boson production cross section for 
ZH — > t^l~bb, expressed as a ratio to the SM cross section. 



Mh (GeV) 90 


95 


100 


105 


110 


115 


120 


125 
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140 


145 


150 


Expected 2.6 
Observed 1.8 
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4.3 
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16 
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FIG. 16: (color online). Global RF output distributions for the VZ search after the fit to data in the S + B hypothesis in 
(a) ST events, (c) DT events, and (e) ST and DT events combined. Distributions are summed over all Z ^ ££ channels. The 
VZ signal distribution, scaled to the measured avz, is compared to the data after subtracting the fitted background in (b) ST 
events, (d) DT events, and (e) ST and DT events combined. Data points are shown with Poisson statistical errors. Also shown 
is the uncertainty on the background after the fit. 



23 





200: 


O 


180^ 


o 


160; 


(N 


140^ 


•4— ' 

c 


120; 


> 


100^ 




so; 




60^ 




40; 




20; 




i 







(a) Single Tag 
tt RF > 0.5 




D0, 9.7 fb"' 
S+B Fit 
+Data iVZ 

-Z+LF^ Z+bb 
■z+cc ^ Top 
iWW iMultijet 



> 

O 
o 

(N 



> 



50 100 150 200 250 300 350 400 
Dijet mass (Kinematic Fit) (GeV) 



> 70 

60 
o 

^ 50 

1 40 
w 30 

20 
10 




(c) Double Tag 
tt RF > 0.5 



D0, 9.7 fb"' 
S+B Fit 
+Data iVZ 

-Z+LFfcZ+bb 
■z+cc ^Top 
LWW iMultijet 



> 

O 
o 

(N 



> 



50 100 150 200 250 300 350 400 
Dijet mass (Kinematic Fit) (GeV) 



(e) ST + DT 
tf RF > 0.5 




D0, 9.7 fb 
S+B Fit 
+Data iVZ 

-Z+LF|z+bb 
■z+cc ITop 
LWW IMultijet 



50 100 150 200 250 300 350 400 
Dijet mass (Kinematic Fit) (GeV) 



> 



40 
30 
20 
10 

-10 
-20 



(b) D0, 9.7 fb"^ 



Single Tag 
tt RF > 0.5 




S+B Fit 



-+-Data- Bkgd 
■ VZ 

— Bkgd± 1 s.d. 



50 100 150 200 250 300 350 400 
Dijet mass (Kinematic Fit) (GeV) 



20 
15 
10 
5 

-5 
-10 
-15 



(d) D0, 9.7 fb" 



Double Tag 
tt RF > 0.5 








-f- Data - Bkgd 
■ VZ 

S+B Fit — Bkgd ±1 s.d. 

50 100 150 200 250 300 350 400 
Dijet mass (Kinematic Fit) (GeV) 



60 
50 
40 
30 
20 
10 

-10 



(f) ST + DT 
tt RF > 0.5 



D0, 9.7 fb"^ 
-+- Data - Bkgd 
■ VZ 

— Bkgd± 1 s.d. 




50 100 150 200 250 300 350 400 
Dijet mass (Kinematic Fit) (GeV) 



FIG. 17: (color online). Dijet invariant mass distributions for the VZ search after the kinematic fit and after the fit to the 
data in the S + B hypothesis in (a) ST events, (c) DT events, and (e) ST and DT events combined. Distributions are summed 
over all Z ^ ££ channels. The VZ signal distribution, scaled to the measured ayz, is compared to the data after subtracting 
the fitted background in (b) ST events, (d) DT events, and (e) ST and DT events combined. Also shown is the uncertainty on 
the background after the S + B fit. 
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FIG. 18: (color online). Global RF output distributions in the tt-depleted region, assuming Mh = 125 GeV, after the fit to 
the data in the B hypothesis for (a) ST events, (c) DT events, and (e) ST and DT events combined. Background-subtracted 
distributions for (a), (c), and (e) are shown in (b), (d), and (f), respectively. Signal distributions for AIh ~ 125 GeV are shown 
with the SM cross section scaled by a factor of 5 in (b), (d), and (f). 
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FIG. 19: (color online). Global RF output distributions in the tt-enriched region, assuming Mh = 125 GeV, after the fit to 
the data in the B hypothesis for (a) ST events, (c) DT events, and (e) ST and DT events combined. Background-subtracted 
distributions for (a), (c), and (e) are shown in (b), (d), and (f), respectively. Signal distributions for AIh ~ 125 GeV are shown 
with the SM cross section scaled by a factor of 50 in (b) , (d) and (f ) . 
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FIG. 20: (color online). Observed and expected LLK values for the S + B and B hypotheses, along with the ±1 and ±2 
s.d. bands for the B hypotheses, as well as observed and expected cross section upper limits (along with the ±1 and ±2 s.d. 
bands for the expected limit) relative to the SM cross section, (a, b) for the ee channel, (c, d) for the fifi channel, (e, f) for the 
eeicR channel, and (g, h) for the /i/itrk channel. 
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FIG. 21: (color online), (a) Observed and expected LLR values as a function of Mh for the S + B and B hypotheses, along 
with the ±1 and ±2 s.d. bands for the B hypotheses, for all lepton channels combined, (b) Expected and observed cross section 
upper limits at the 95% C.L. for ZH — >■ t^l~bh production, relative to the SM cross section. 



